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Abstract 

We show the existence of rigid combinatorial objects which previously were not known to exist. 
Specifically, for a wide range of the underlying parameters, we show the existence of non-trivial orthog- 
onal arrays, f-designs, and f-wise permutations. In all cases, the sizes of the objects are optimal up to 
polynomial overhead. The proof of existence is probabilistic. We show that a randomly chosen such 
object has the required properties with positive yet tiny probability. The main technical ingredient is a 
special local central limit theorem for suitable lattice random walks with finitely many steps. 

1 Introduction 

We introduce a new framework for establishing the existence of rigid combinatorial structures, such as 
orthogonal arrays, f-designs and f-wise permutations. Let B be a finite set and let V be a vector space of 
functions from B to the rational numbers Q. We study when there is a small subset T C B satisfying 

mI/W = 4tI/(^) for all /in v. (1) 

I-* I teT 1^1 beB 

In probabilistic terminology, equation ([T]) means that if f is a uniformly random element in T and b is a. 
uniformly random element in B then 

E[f{t)]=E[f{b)] for all/ in V, (2) 

where E denotes expectation. Of course, ([T]) holds trivially when T = B. Our goal is to find conditions on 
B and V that yield a small subset T that satisfies ([U, where in our situations, small will mean polynomial in 
the dimension of V. (In many natural problems one might encounter a function space V over M or C instead. 
However, since ([T]l is a rational equation, we can always reduce to the case of rational vector spaces.) 

Our main theorem. Theorem 12.11 gives sufficient conditions for the existence of a small subset T sat- 
isfying ([Til. We apply the theorem to establish results in three interesting cases of the general framework: 
orthogonal arrays, f-designs, and f-wise permutations. These are detailed in the next sections. Our methods 
solve an open problem, whether there exist non-trivial f-wise permutations for every t. They strengthen 
Teirlinck's theorem UTeiSVII . which was the first theorem to show the existence of f-designs for every t. And 
they improve existence results for orthogonal arrays, when the size of the alphabet is divisible by many 
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distinct primes. Moreover, in all three cases considered, we show the existence of a structure whose size is 
optimal up to polynomial overhead. 

Our approach to the problem is via probabilistic arguments. In essence, we prove that a random subset 
of B satisfies equation ([Hi with positive, albeit tiny, probability. Thus our method is one of the few known 
methods for showing existence of rare objects. This class includes such other methods as the Lovasz local 
lemma IIEL751 and Spencer's "six deviations suffice" method |Spe85|. However, our method does not 
rely on these previous approaches. Instead, our technical ingredient is a special version of the (multi- 
dimensional) local central limit theorem with only finitely many available steps. Since only finitely many 
steps are available, and since we can only gain access to more steps by increasing the dimension of the 
random walk, we cannot use any "off the shelf" local central limit theorem, not even one enhanced by a 
Berry-Esseen-type estimate of the rate of convergence. Instead, we prove the local central limit theorem that 
we need directly using Fourier analysis. Section [L4] gives an overview of our approach. 

We also mention that efficient randomized algorithm versions of the Lovasz local lemma IIMos091lMT10ll 
and Spencer's method IB an 101 have recently been found. Relative to these new algorithms, the objects that 
they produce are no longer rare. Our method is the only one that we know that shows the existence of rare 
combinatorial structures, which are still rare relative to any known, efficient, randomized algorithm. 



1.1 Orthogonal arrays 

A subset T C [q]" is an orthogonal array of alphabet size q, length n and strength t if it yields all strings 
of length t with equal frequency if restricted to any t coordinates. In other words, for any distinct indices 
/i , . . . , /f G [«] and any (not necessarily distinct) values vi , . . . , v, G [q], 

\{x^ T =vi,...,Xi, =Vt}\ =q^'\T\. 

Equivalently, choosing x = {xi, . . . ,Xn) € T uniformly, the distribution ofx ^ [g]" is f-wise independent. For 
an introduction to orthogonal arrays see IIHSS99II . 

Orthogonal arrays fit into our general framework as follows. We take B to be [q]" and V to be the space 
spanned by all functions of the form 

[l .x; = v; for all / G / 

f(^I){xi,...,Xn) = < . , (3) 

10 Otherwise 

with / C [«] a subset of size t and v G [q]'. With this choice, a subset T C B satisfying ^ is precisely an 
orthogonal array of alphabet size q, length n and strength t. 

t /I 

It is well known that if T C \q]" is f-wise independent then |r| > (^)^ for some universal constant 
c > (see, e.g., IIRao731 ). Matching constructions of size |r| < q'^' (j)''' are known, however, as these rely 
on finite field properties the constant Cq generally tends to infinity with the number of prime factors of q. 
Our technique provides the first upper bound on the size of orthogonal arrays in which the constant in the 
exponent is independent of q. 

Theorem 1.1 (Existence of orthogonal arrays). For all integers q>2,n>\ and I <t <n there exists an 
orthogonal array T of alphabet size q, length n and strength t satisfying \T\ < {qnY' for some universal 
constant c > 0. 



1.2 Designs 

A (simple) t-(y,k,X) design is a family of distinct subsets of [v], where each set is of size k, such that each t 
elements belong to exactly A sets. In other words, denoting by [^] the family of all subsets of [v] of size k, a 
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set T C [^] is a f-design if for any distinct elements .. ,it G [v], 

\{seT:h,...,Ues]\ = ^\T\=X. (4) 

For an introduction to combinatorial designs see iCDOTl. 

Our general framework includes ;-designs as follows. We take B to be [^] and V to be the space spanned 
by all functions of the form 

fa{b) = [' , (5) 

1 Otherwise 

with a G [J"] . With this choice, a subset T C B satisfying ([T]) is precisely a simple t — {v,k,X) design. 

Although ^-designs have been investigated for many years, the basic question of existence of a design 
for a given set of parameters t,v,k and A remains mostly unanswered unless t is quite small. The case 
f = 2 is known as a block design and much more is known about it than for larger t. Explicit constructions 
of f-designs for f > 3 are known for various specific constant settings of the parameters (e.g. 5-(12,6, 1) 
design). The breakthrough result of Teirlinck IITei87l was the first to establish the existence of non-trivial 
^-designs for t >1. In Teirlinck's construction, k = t + 1 and v satisfies congruences that grow very quickly 
as a function of t. Other sporadic and infinite examples have been found since then (see IICD07II or ]Mag09 1 



and the references within), however, the set of parameters which they cover is still very sparse. Moreover, 
it follows from ^ that any t — {v,k,?i) design T has size |r| = A (^^) / (^) > {v/k)'. Even when existence 
has been shown, the designs obtained are often inefficient in the sense that their size is far from this lower 
bound. One of the main results of our work is to establish the existence of efficient ^-designs for a wide 
range of parameters. 

Theorem 1.2 (Existence of f-designs). For all integers v> I, I <t <v and t <k <v there exists a t-{v,k,X) 
design whose size is at most v'^ for some universal constant c > 0. 

1.3 Permutations 

A family of permutations T C Sn is called a t-wise permutation if its action on any f-tuple of elements is 
uniform. In other words, for any distinct elements i\,.. . ,it G [n] and distinct elements ji,...,jt G [n], 

\{neT:nii,)=n,...Mir) = jr}\ = ^^^_^^_\^_^^^f \. (6) 

Our general framework includes f-wise permutations as follows. We take B = Sn and V to be the space 
spanned by all functions of the form 



1 b{h)= i\,---,b{h)= it 
Otherwise 



where / = {i\,...,it) and j = ( ji , . . . , ) are f-tuples of distinct elements in [n] . With this choice, a subset 
T C B satisfying ^ is precisely a f-wise permutation. 

Constructions of families of t-wise permutations are known only for f = 1 , 2, 3: the group of cyclic shifts 
xh^ x + a modulo « is a 1-wise permutation; the group of invertible affine transformations x^ ax + b over 
a finite field F yields a 2- wise permutation; and the group of Mobius transformations x i— )• {ax + b)/{cx + d) 
with ad — be = I over the projective line F U {oo} yields a 3-wise permutation. For t>4 (and n large enough), 
however, no f-wise permutation is known, other then the full symmetric group 5„ and the alternating group 
A„ IIKNR051IALTTI . In fact, it is known (c.f., e.g., IICam95J . Theorem 5.2) that for « > 25 and f > 4 there 
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are no other subgroups of Sn which form a f-wise permutation. (On other words, there are no other t- 
transitive subgroups of Sn for f > 4 and « > 25.) One of our main results is to show existence of small t-wise 
permutations for all t. 

Theorem 1.3 (Existence of f-wise permutations). For all integers n> I and I <t <n there exists a t-wise 
permutation T C S„ satisfying \T\ < exp{t'^)n^' for some universal constant c > 0. 

It is clear from the definition ^ above that any ?-wise permutation T must satisfy |r| >n{n — l)---{n — 
t + \) = n^^'\ Thus, for fixed t, the f-wise permutations we exhibit are of optimal size up to polynomial 
overhead. For t growing with n these f-wise permutations may be larger, but still no larger than n'^ for some 
universal constant c > 0. 

1.4 Proof overview 

The idea of our approach is as follows. Let T be a random multiset of B of some fixed size chosen by 
sampling B uniformly and independently N times (with replacement). Let {<pa)aeA be a spanning set of 
integer-valued functions for V (where A is some finite index set). Observe that T satisfies ^ if and only if 

for all a in A. (7) 

Thus defining an integer-valued random variable 

Xa ■= £ 0.(0 

teT 

and X := {Xa)aeA £ we see that existence of a subset of size N satisfying ^ will follow if we can show 
that P[X = E[X]] > 0. To this end we examine more closely the distribution of X. Let ti,... ,t]^he the random 
elements chosen in forming T. The spanning set {<pa)aeA defines a mapping (p : B hy the trivial 

Observe that our choice of random model implies that the vectors (0 are independent and identically 

distributed. Hence, 

^ = £0(fO (8) 

i 

may be viewed as the end position of an A^-step random walk in the lattice Z'^^L Thus we may hope that if 
is sufficiently large, then X has an approximately (multi-dimensional) Gaussian distribution by the central 
limit theorem. If the relevant local central limit theorem holds as well, then the probability F[X = x] also 
satisfies a Gaussian approximation. In particular, since a (non-degenerate) Gaussian always has positive 
density at its expectation, we could conclude that F[X = ]E[X]] > as desired. 

The above description is the essence of our approach. The main obstacle is, of course, pointed out in 
the last step. We must control the rate of convergence of the local central limit theorem well enough that 
the convergence error does not outweigh the probability density of the Gaussian distribution at K[X]. Recall 
that the order of magnitude of such a density is typically c^''*' for some constant c > 1, and recall that 
|A| is at least the dimension of V, which is the main parameter of our problem. So we indeed have very 
small probabilities. For this reason, and because we want convergence when A'^ is only polynomial in the 
dimension of V, we were unable to use any standard local central limit theorem. Instead, we develop an ad 
hoc version using direct Fourier analysis. 

In our proof of the main theorem, we modify the above description in one respect. It is technically 
more convenient to work with a slightly different probability model. Instead of choosing T as above, we set 



A^ 



teT 



beB 



teT 
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p := N/\B\ and define T by taking each element of B into T independently with probability p. This has the 
benefit of guaranteeing that T is a proper set instead of a multiset. However, it has also the disadvantage 
that it does not guarantee that \T\ = N. To remedy this, we assume that the space V contains the constant 
function h{b) = I; or if not, we can add it to V at the minor cost of increasing the dimension of V by 1 . With 
this assumption, we note that 

E[l^hit)]=n\T\]=N. 

teT 

Thus (I7]l, or equivalently X = E[X], also implies that \T\ = N as required. Another disadvantage is that in 
this new probability model, the vector X is no longer a sum of identically distributed variables. However, 
since the summands in ([Hi are still independent, we can continue to use Fourier analysis methods in our 
proof. 

We cannot expect there to always be a small subset T that satisfies (dJ. For instance, Alon and Vu IIAV97II 
found a regular hypergraph with n vertices and n"/^ edges, with no regular sub-hypergraph. Here, the 
degree of a vertex is the number of hyperedges incident to it and a regular hypergraph is one in which the 
degrees of all vertices are equal. We may describe their example in our language by letting B be the set of 
edges of this hypergraph, A be its vertex set, and define : B — )■ {0, 1}'* by letting be the indicator 
function of the set of vertices incident to b. The result of IIAV97II implies that while the vector LfceB0(^) 
is constant, this property is not shared by Y.teT^{t) for ^^ly non-empty, proper subset T C B. Thus, we 
need to impose certain conditions on B and V, or equivalently on the map (p. We start by requiring certain 
divisibility, boundedness and symmetry assumptions. 

Divisibility: N is such that T.heB (^) is an integer vector. This property is clearly necessary for ^ to 
hold and is typically a mild restriction on N. 

Boundedness: The entries of <p must be small. More precisely, maXaeA.heB |0(^)a| is bounded by a poly- 
nomial in dimV, since our method requires N to be at least some polynomial in this maximum. 

Symmetry: A symmetry of <p is a pair consisting of a permutation n £ Sb and a linear transformation 
T G GL{V) which satisfies (p{n{b)) = x{<p{b)) for all b £ B. The set of symmetries {7l,x) of is a 
subgroup of Sb x GL{V). We require that the projection to B of the group of symmetries is transitive. 
In other words, that for any bi,b2 £ B there exists a symmetry (tt, t) of ^ satisfying n{b\ ) = Z?2. 

It is not hard to verify that the third condition is intrinsic to the structure of V and does not depend on the 
specific choice of spanning set (0„). In our applications it follows easily from the overall symmetry of the 
setup. 

However, we also have a fourth assumption which is more technical than the others. First, we require 
that {(l)a)aeA forms a basis of V. This impUes that for any a G A, we may express Ca, the unit vector with 1 at 
its fl'th coordinate, as a linear combination of the form Y.beBCb<P{b)- We call any such linear combination an 
isolating combination for a. We assume that for each a G A, there are many isolating combinations supported 
on disjoint subsets of B. Moreover, we require the coefficients of these combinations to have small norm 
and to be rational with a small common denominator. This is the most difficult assumption to verify in our 
applications. Section |2] gives more details about all of these assumptions. 

Our main theorem shows that these four conditions yield the existence of a small solution of ([Til. 

Theorem (Main theorem - informal statement). Let B be a finite set and let V be a vector space of functions 
from B to Q which contains the constant functions. If there exists a basis {^a)aeA ofV, consisting of integer- 
valued functions, which satisfies the boundedness, symmetry and isolation conditions above. Then there is 
a small subset T C B such that 

I-* I teT I-* I beB 
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for all f in V. 

We note that the size A'^ = | T | of the subset obtained must satisfy the divisibility condition above. The 
existence theorems for orthogonal arrays, f-designs and f-wise permutations follow by showing that for the 
choice of B and V detailed in Sections [l . 1 1 through 11.31 there exists a choice of basis {0^} and small N for 
which all four conditions above hold. 

1.5 Related work 

In the probabilistic formulation Q of our problem we seek a small subset T C B such that the uniform 
distribution over T simulates the uniform distribution over B with regards to certain tests. There are two 
ways to relax the problem to make its solution easier, and raise new questions regarding explicit solutions. 

One relaxation is to allow a set T with a non-uniform distribution /i. For many practical applications 
of f-designs and f-wise permutations in statistics and computer science, but not quite every application, this 
relaxation is as good as the uniform question. The existence of a solution with small support is guaranteed 
by Caratheodory's theorem, using the fact that the constraints on n are all linear equalities and inequalities. 
Moreover, such a solution can be found efficiently, as was shown by Karp and Papadimitriou [KP82] and 
in more general settings by KoUer and Megiddo [KM94J. Alon and Lovett HALlll give a strongly explicit 
analog of this in the case of f-wise permutations and more generally in the case of group actions. 

A different relaxation is to require the uniform distribution on T to only approximately satisfy equation 
(111). Then it is trivial that a sufficiently large random subset T C B satisfies the requirement with high 
probability, and the question is to find an explicit solution. For instance, we can relax the problem of f-wise 
permutations to almost t-wise permutations. For this variant an optimal solution (up to polynomial factors) 
was achieved by Kaplan, Naor and Reingold IIKNR05I . who gave a construction of such an almost f-wise 
permutation of size n'^^'\ Alternatively, one can start with the constant size expanding set of 5„ given by 
Kassabov IIKas07ll and take a random walk on it of length 0{tlogn). 

1.6 Paper organization 

We give a precise description of the general framework and our main theorem in Section |2] We apply it to 
show the existence of orthogonal arrays and f-designs in Section[3] The case of f-wise permutations requires 
a detour to the representation theory of the symmetric group, and we defer it to the full version of this 
paper. The proof of our main theorem is given in Section |4] We summarize and give some open problems 
in Section |5] 

2 Main Theorem 

Let B be a finite set and let V be a vector space of functions from B to Q. We ask for conditions for the 
existence of a small set T C B for which ([T]) holds. Our theorem uses the following notation. 

For abasis (0a)aeA (where A is some finite index set) ofV we define : B — Z'* by = (paib). This 
definition is extended linearly to (p : ^ lA by setting 0(7) = LfoeB7fe0(^)- In the same manner, a set 
r C B is identified with its indicator vector so that <p{T) = 0(0- Finally, we recall from Section [L4l 
that a symmetry of is a pair 71 G and T G GL{V) such that <p{n{b)) = T{<p{b)) for all b in B. We now 
state formally our main theorem. 

Theorem 2.1 (Main Theorem). Let B be a finite set and V be a vector space of fiinctions from B to Q which 
contains the constant functions. Suppose that there exist integers m,co > 1, real numbers ci,C2,c^ > and 
a basis {(l)a)aeA ofV consisting of integer-valued functions such that: 
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Divisibility: |^0(B) is an integer vector. 
Boundedness: ||0(ft)||2 <cifor all b £ B. 

Symmetry: For each b\,b2 € B there exists a symmetry (71, t) of such that Ti{b\) = ^2- 
Isolation: For any a G A there exist vectors 7i , . . . , 7,- G lP for r > \B\/c2 such that 

• {ji) — ^ ' ^afor all i G [r]. 

• The vectors 71 , . . . , 7^ have disjoint supports, where the support of a vector y £lP is the set of 
coordinates on which it is nonzero. 

• II7II2 < ci, for all i G [r]. 

Then there exists a subset T C B with \T\ < poly (|A|,m,co,ci, 02,03) such that 

^L/W = ^I/W for all f in V. 

M I teT \"\ beB 

We prove Theorem 12.11 in Section |4] A careful examination of the proof shows that we can choose 
\T\ =N for any N > I which satisfies the following constraints: 

• Com divides N; 

• N> ■ max(m^ \A\^m^log^{\A\mcoCiC2C3), |A|^4c^cf log^(|A|mcoCiC2C3)); 

• N<0{^/\B\). 

Of course, if the parameters are so large so that the second and third conditions contradict each other, then 
our theorem remains trivially true by taking T = B. 

3 Applications 

In this section we apply our main theorem, Theorem l2.1[ to prove the existence results for orthogonal arrays 
and f-designs. Theorems 11.11 and 11.21 The existence result for f-wise permutations, Theorem 11.31 is more 
complicated because it requires a discussion of the representation theory of the symmetric group. We defer 
it to the full version of this paper. 

3.1 Orthogonal arrays 

We use the choice of B and V described in Section [TT] and recall the definition Q of the functions /(/ ^j of 
that section. We note that for every subset / we have T,ve[q]' f(i.v) = 1- Thus V contains the constant functions 
as Theorem 12. 1 [ requires. We start by choosing a convenient basis for V of integer- valued functions. Recall 
that the alphabet is [^] = {1, . . . ,^7} and let — 1] = {1, ... ,17 — 1} be all symbols other than q. Extend the 
definition Q of /(/ ^,) to apply to all subsets / with |/| < t and v G [q]'. Here, we mean that /(g 0) is the 
constant function 1 . Finally, let 

A:={{I,v):\I\<t,ve[q-lf} 
and for a = (/, v) G A set ■= f{i,v)- 
Claim 3.1. The span of the functions {^a}aeA is V. 
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Proof. Clearly 0a G V for all a G A. To see that {^a}aeA spans V , we will show that any with |/| < t 
and V G [q]' is spanned by We do this by induction on the number of elements in v which are equal 

to q. First, if v G [q— 1]^ then /(/,,) = 0(/ y). Otherwise, let / = {/i, . . . ,/,} with r < v G [q]' and assume 
WLOG that v;i = ^. Then 

9-1 

/(/,v) = /({ (2,... ,(',-}, (v,2,...,V,-,)) 2-1 J{I,{m,Vi^,...,Vir)) 

and by induction, the right hand side belongs to the linear span of {<^a}aeA- D 

Recall that : B — > Z'* is defined as <^{b)a = <pa{b)- We now choose integers m,co > 1 and real numbers 
ci ,C2,C3 > such that the conditions of divisibility, boundedness, symmetry and isolation required by Theo- 
rem l2.1l are satisfied. First, let a = (/, v) G A. Note that ^0(6)^ = ^^'^L Thus we set cq = q' so that 

is an integer vector. Second, we clearly have for any b £ B that 110(^)112 = L/=o CD ^ ('^ + Hence we 
setci = {n + \y/^. 

Third, to witness the symmetry condition, fix x G [q]" and consider the permutation n £ Sb given by 
n{b) = b + x (mod 17). We need to show that there exists a linear map T acting on V such that <p{n{b)) = 
x{^{b)) for all b £ B. This holds since for a = (/, v) G A we have 

^{K{b))a=fi.vib+x{mod q)) = fuv-^ (mod q){b) 

and // v-.v (mod 9) G y is in the linear span of {<^a}aeA by Claim I3H 

The fourth condition we need to verify is the existence of many disjoint isolation vectors for each a G A. 
Note that this condition also implies that {(l>a}aeA is a basis for V . This is established in the following lemma. 

Lemma 3.2. Let a £A. There exist disjoint vectors 71, . . . , 7^ G with r > \B\/{q'rP'') and \\ji\\2 < 2^'l'^n' 
such that ^{ji) = Sa- 

We prove Lemma [l!2] in two steps. First we fix some notations. Let K C [n] be of size \K\ < t, and let 
K'^ = [n] \ K. For x G [q]" let x|^ G [q]'^ be the restriction of x to the coordinates of K. Abusing notation, we 
also think of x\k G [q]" by setting coordinates outside K to zero. Note that in this notation, fi^v{x) = l{x\i = 
v}. We define the vector 5x^k £ as 

JGK 

where we recall that for b £B,eb £ {0, 1}^ is the corresponding unit vector. Note that if = then 5^ = Cx- 
Claim 3.3. Let a = (/, v) G A. Then 

( ifK<lI 

I 1 ifa\K=x\K 

Proof. We compute the value of ^{5x,k) in coordinate a = (/,v) G A. We have 

<?'(5.,^)«= I(-l)I^H'll{(W)l/ = v}. 

Suppose first that K L Then there exists j £ K\L Flipping the j'-th element in / doesn't change the 
expression 1{(x|7ua'' )l/ = ^} ^nd hence the alternating sign sum cancels. We thus assume from now on that 
KCLWe thus have 

mx\juK^)\i = v} = l{x\j = v\k andx|/fcn/ = v|s:4. 
This expression evaluates to 1 only if 7 = A" and x|/ = v. □ 
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We next prove Lemma [3^ showing that we can build many disjoint isolation vectors for any a G A. The 
proof uses the vectors S^ k we just analyzed. 

Proof of Lemma [O] Fix a = (/,v). Let x £ [q]" be such that x\j = v. We will construct a vector Yx,i such 
that ^{Yxj) = ^a- We will do so by backward induction on |/| < t. If |/| = f we take 

Yx,i ■= SxJ, 

and if |/| < ? we construct recursively 

Yx,i'=Sxj- y^^K- 

K^I,\K\<t,XKe[q-\\l^ 

It is easy to verify using Claim [33] that indeed ^{Yx,i) = as claimed. We further claim that ||7a:,/||2 < 
2^/^(2n)'^l^l . This clearly holds if |/| = t. If |/| < ? we bound by induction 

t 

||7r,/||2 < ||4,/||2+ £ £ WYxMh 

k=\I\ + lKDl,\K\=k 
\ k=\I\ + l J 

V k=\i\+i J 

To conclude, we need to show that by choosing different values for x such that x|/ = v we can achieve 
many disjoint vectors which isolate a. The key observation is that jxj is supported on elements b £ B 
whose hamming distance from x is at most t. Thus, if we choose xi, . . . ,x,. E [q]" such that = v and 
such that the hamming distance between each pair Xi,Xj is at least 2t + 1, we get that jxiJ, Yx,J have 
disjoint supports. We can achieve r > q"^' /n^' by a simple greedy process: choose x\,...,Xr iteratively; 
after choosing x, delete all elements in [g]" whose hamming distance from x, is at most 2t. Since the number 
of these elements is bounded by Y^L\ (") < the claim follows. □ 

We now have all the conditions to apply Theorem l2. 1[ We have \A\ = X^-^g (") 1)' ^ (^('^ + l))^co = 
^',ci = («+ 1)'/^,C2 = q'rP'' ,C3 = and m = 1. Hence we obtain that there exists an orthogonal array 

T C [q]" of strength t and size \ T\< {qnY for some universal constant c > 0. 

3.2 Designs 

In this section, we prove Theorem 11.21 It suffices to prove the theorem for k > 2t, since if ^ < 2f then the 
complete design (the design containing all subsets of size k) establishes the theorem. We use the choice of 
B and V described in Section 11.21 and recall the definition ^ of the functions fa of that section. We set 
A = [^^'] and note that Y^aeAfa = [f] and thus V contains the constant functions as Theorem [2?T] requires. As 
a convenient basis for V of integer-valued functions, we take {<^a}aeA with 0^ = fa- By definition, {(^a}aeA 
spans V and the fact that {^a]aeA is a basis for V will be implied by showing the isolation condition of 
Theorem 12. II 

We choose integers m,C() > 1 and real numbers ci,C2,C3 > to satisfy the conditions of divisibility, 
boundedness, symmetry and isolation in Theorem |2!T] First, = (^)/(^^') • (1, . . . , 1) and hence we set 
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Co = (j) so that is an integer vector. Second, ||0(Z7)||2 < |A| < v' . Hence we set ci = v'^^. Third, the 

symmetry condition also follows simply: let a G 5,, be a permutation on [v]. It acts naturally on B and A (by 
permuting subsets of [v]) and gives two permutations n £ Sb and ft £ Sa that satisfy <p{n{b))a = 0(^)7r-i(a)- 
The linear transformation T G GL{V) then corresponds to the permutation 7t^^. 

Finally, we need to show that for each a G A there exist many disjoint vectors which isolate it. This is 
accomplished in the following lemma. 



Lemma 3.4. Assume k > 2t. For any a € A there exist vectors y\,. . . ,Yr £lP with r > \B\/{vk)'^' such that 
ki 
(k-t)\ 



(l>{Yi) = Y ■ Ca- Moreover, ji^ ... ^y^ have disjoint supports and \\yi\\2 < {Ik)^' 1^ for i £ [r]. 



We will need the following technical claim for the proof of Lemma [331 In the following we consider 
binomial coefficients f ") =0 whenever n <m. 

Claim 3.5. Let a> b>0 and c > 0. Then 

Proof. Let f{a,b,c) = LUi-^Y Q Tb) ■ If b,c > we have (^+') = (^-^+') + and hence 

f{a,b,c) = f{a,b,c— 1) +f{a,b— l,c — 1). So, it is enough to verify the claim whenever b = or c = 0. 
lfb = then f{a,0,c) = Ito(-l)'( •) = since a>l. If c = then f{a,b,0) = •) il) 



Proof of Lemma l3A\ Let a G A = [^'] be a coordinate we wish to isolate. Let x G [^] be a set disjoint from a 
and let < J < f. Define dx^aj £ to be the indicator vector for all subsets b e B = [^] such that b CaUx 



and \anb\ = j, that is 



We define vectors ya^ G as 



Sx,a,j ■= 52 

bGaUx,\b\=k,\anh\=j 



We will shortly show that 



k\ 



ik-t)\ 

First we bound the norm of y^^a and show the existence of many disjoint vectors. It is easy to check that 
llT^.alb < {2k)^'/^. Also, the vector is supported on coordinates y G B such that \yr\x\ >k — t. Thus, if 
we choose xi,...,Xr G S such that n.xy| < k — 2t — I we get that the vectors 7xi,a, • • • , yx^.a have disjoint 
support. We can choose r > \B\/{vk)-^' by a simple greedy argument: choose xi,... ,x,. iteratively, where in 
each step after choosing Xj we remove all subsets y £ B whose intersection with Xj is at least k — It. The 
number of subsets eliminated in each step is at most (vk)^' hence we will get r > \B\/{vk)^' . 

To conclude the proof, we need to compute 0(7x,«). Let a' G A. Clearly if a' aUx then <p{yx,a)a' = 0. 
We thus assume that a' C aUx. Let i = \ana'\ where < i <t. We have that (p {5x^a,j)a' = if j < £, and 
that 

^{8x,a,j)a' = \{b£B -.a CbCaLlx,\anb\ = j}\ 
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Hence we have that 

lt-i\ Ik-t+t 



hy.,). = J^_ii-^y-^^kp- (9) 



J 



If a' = a then ^{Yx,a)a = k\/{k — t) \ as claimed. To conclude we need to prove that if a' 7^ a then ^{Yx.a)a' = 0. 
We have i=\aria'\ < f and let s = f - ^ > 0. Thus 



,(r ) ( iyJ^y( IV WH"^ 

Hr.,a)a' - (-1) (^_,_1)!L(-1V— (Y)- 

_ jk-iy J. /s\/k-i-i-f 
-^-'^ (.-.-i)!e:;),&-^) .-1 
_ {k-iy - /A A_^_l_,+/ 
We now apply Claim [375] with a = s,b = s — l,c = k — i — I — s and conclude that <p {Yx.a)a' =0. □ 

We are now ready to apply Theorem 12. II We have |A| = (J^) ,co = (J) ,ci = v'/^,C2 = (yk)^' ,03 = {2ky^^ 
and m = k\/{k — t)\. Thus the theorem implies the existence of a f — (v, ^ , A ) design T C B with | T | < v'^' for 
some universal constant c > 0. 



4 Proof of Main Theorem 

We prove Theorem 12. 1 1 in this section. We recall the settings: B is a finite set and V is a vector space of 
functions from B to Q. We assume the space V is spanned by integer valued functions {^a '■ B — >■ '^}aeA, 
where A is a finite index set. We also assume that the constant functions belong to V. 

The proof strategy is conceptually simple: choose T randomly and show that this choice is successful 
with positive probability. Let A'^ be the target size of T, to be chosen later. Let each ^ G B be chosen to be 
in T independently with probability p := N/\B\. Identifying T with its indicator vector in {0, 1}^, we have 
that Tb G {0, 1} with F[Tb = I] = p. Define X = (^{T)^Z^ and note that E[X] = p -0(6). In order to prove 
Theorem 12. II we need to show that 

P[X = E[X]] > 0. (10) 

We make two notes: first, since we assume that constant functions belong to V we have that if X = E[X] 
then in particular |X| = p\B\ = N. Second, in order for (ITOl ) to hold we must have that E[X] is an integer 
vector. Thus, we must choose N to be divisible by cq. 

The difficulty with establishing (ITOl ) comes from the fact that we require A different events to occur 
simultaneously: for all a G A we require that Xa = EfX^]. To better explain the challenge, consider momen- 
tarily for simplicity the case where <^{b) G {0, 1}'* for all G B and that for each a G A, FheB[<^{b)a = 1] = ^ 
(that is, all columns of ^ have qB ones). Then each individual is binomially distributed, Xa ~Bin(|B|,p^), 
and it is not hard to see that 

P[X, = E[XJ]«^. 

y/qN 

However, we need the events Xa = E[Xa] to occur simultaneously for all a G A. The problem arises because 
these events are dependent, and general techniques for handling such dependencies (for example, the Lovasz 
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local lemma) only work when each event depends only on a few other events (which is not the case here) 
and where each event holds with sufficiently high probability (which is also not the case here). What we 
show is that, under the conditions of Theorem [ZT] if we choose A'^ large enough (but only polynomially large 
in |A|,m,co,ci,C2,C3) then all the events Xa = E[X„] become essentially independent, and we show that 

F[X = nx]] « llF% = « (^)l^l. 

aeA V^^ 

The actual expression we get is somewhat more complicated as it also involves pairwise correlations between 
the different events X„, but conceptually it is of a similar flavor. 

Our main technique to study the distribution of the random variable X G Z'* is Fourier analysis. We 
recall some basic facts about Fourier analysis on Z^. 

Fact 4.1 (Fourier analysis on Z,^). Let X £ 1/^ be a random variable. The Fourier coefficients ofX live in 
the A-dimensional torus. Let T = [— 1 /2, 1 /2) denote the torus. The Fourier coefficients X[Q) for G T"^ 
are given by 

where {X,d) = Y,aeA^a^a- The probability that X = Xfor X dZA is given by the Fourier inversion formula 

F[X = X] = I X{d)e-^''''^^-^'>dd. 

Recall that our goal is to understand the probability that X = E,[X]. Applying the Fourier inversion 
formula for A = E[X] gives 

F[X = K[X]]= f f(0)e"2;r;(E[x],e)^0_ ^^^^ 

Thus, our goal from now on is to understand the Fourier coefficients of X. We first give an explicit formula 
for the Fourier coefficients. 

Claim 4.2. We have 

X{d) = Y[(.i-P + pe^'"-^^^'^''^). 
beB 

Proof. By definition X = <p{T) = Y,beB Tb^{b), where T}, G {0, 1} are independent with ¥[Ti, = \]= p. Thus 

= niETiJ^'''''''^''^^^^^'^^] = -/' + P^''"'^^^^^'^^)• 
iGB beB 

□ 

Clearly all Fourier coefficients of X have absolute value at most 1 . The first step is to understand the 
maximal Fourier coefficients of X, that is B for which = 1. 

Claim 4.3. LetL:= {e G T-* : X{e) = 1}. Then 

• If e (^L then |X(0)| < 1. 

• // G L, 0' G T-* then X{B + 6') = X{d'). In particular, L is a subgroup o/T^. 

Proof. Both claims follow immediately from the observation that G L iff ((^(Z?), 0) G Z for all G fi. □ 
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In fact, the isolation conditions in Theorem 12.11 imply that L is a discrete subgroup of T"^ (i.e. a lattice). 
Let M := {\/m- Z)'* be the lattice in T'* of all elements whose coordinates are integer multiplies of 1 /m. 
We show that L is a sublattice of M. 

Claim 4.4. L C M. 

Proof. Let 6 ^ L. We need to show that mOa G Z for all a G A. By the isolation condition of Theorem 12.11 
there exists y ^1? such that (^(7) = mea- Since d ^Lsns. have that (0 {b),d) G Z for all b £ B. Hence also 
(0(7), 0) G Z, i.e. mGa G Z as claimed. □ 

The first step we take is to approximate the Fourier coefficients of X near the lattice L. This will assume 
very little about (j), essentially only boundedness. The second (and more complex) step will be to show 
that all other Fourier coefficients are negligible, and in fact the contribution to (fTTI ) all come from Fourier 
coefficients near L. The second part will heavily utilize the symmetry of the map (p and the existence of 
many disjoint isolation vectors. Theorem 12. 11 then follows by a careful setting of parameters and a routine 
calculation. 

Formally, we will use £2 distance on T^. For ;c G T define its absolute value \x\ = \x (mod 1)| to be 
the minimal absolute value of x modulo 1 (that is, we take x (mod 1) G [—1/2, 1/2]). Define the distance 
between 0', d" G by 

d{e',d") := /Iie^-e^p. 

V aeA 

The distance between 6 and L C T"^ is given by 

d(d,L) ■=mmd(d,a). 

aeL 

The following three lemmas are the main technical ingredients of the proof. The first lemma gives a 
good approximation for the Fourier coefficients of X near zero (and by Claim 1431 near any point in L). 

Lemma 4.5 (Estimating Fourier coefficients near zero). Assume the conditions of Theorem \2.1\ and fix 
e < 0(1/ (ciN^/^)). Let d ^T^ be such that \\d\\2 < £■ Then 

X(0) = ^2;r,-(E[X],e)^-4;rVe^fie(l + 5) 

where R is the Ax A pairwise-correlation matrix of given by Ra\a" = llheB^il^)a'^ib)a"> '^nd where 
\5\=0{N'^/\B\+Nc\£^). 

The second lemma bounds the Fourier coefficients of X far from the lattice M. 

Lemma 4.6 (Bounding Fourier coefficients far from M). Assume the conditions ofTheorem \2.1\ Let G T"^ 
be such that d{G,M) > e. Then 

The third lemma bounds the remaining Fourier coefficients which are near M but far from L. In the 
following let M \ L denote the set of elements in M but not in L. 

Lemma 4.7 (Bounding Fourier coefficients near M but far from L). Assume the conditions of Theorem 12. 1 1 
and fix s < l/{lc\m). Let d ^T^ be such that d{6,M\L) < e. Then 

|X(e)| <exp(-jV- 9}^] ) . 

V m2|A|log(ci|A|)y 

We prove Lemmas [4.5114. 61 and 14 . 7 1 in Sections |4~T1 14. 21 and 14. 3[ respectively. We combine them to prove 
Theorem 12. ll in Section l4!4l 
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4.1 Estimating Fourier coefficients near zero 

Let dGT^be such that ||0||2 < e. We may assume that e < 0(1 / {ciN^^^)) otherwise the conclusion of the 
lemma is trivial. We decompose 

^-2nmx],e) ,2^e) = Yl fe-^^iPim-S) . ^i_p + pe^^i{m,e)^\ . (12) 
heB ^ ' 

Let V}y := {<p{b),6) where the inner product is taken over M. Since we assume ||0||2 < £ we can bound 
\Vb\ < II 0(^) lb II 6 lb < ci£ <^ 1. Thus we can approximate the terms in ([T2l ) by their Taylor series. The 
following claim gives a cubic approximation. 

Claim 4.8. Let f -.R^C be given by f{x) := e-'P^'il -p + pe'""). Then for \x\ < 1 we have 

f{x)=e-P^\\ + 8), 

where \5\ < 0{p^x^ + px^). 

Proof. We compute the cubic approximation for f{x) as a polynomial in p,x. In the following we use 
shorthand expression x = y + 0{z) for \x — y\ =0{z). We have 

f{x) = {\-p)e-'P' + pe'^'-P^^ 

= {l- p){l-ipx + {p'^x^ )) + p{\ + i{l-p)x-x^±0 {px^ +x^)) 
= \-px^ + 0{p^x^ + px^) 

= e-P''^ + 0{p^x^ +px^). □ 

We next apply the approximation given in Claim 148] to each of the terms appearing in (fT2l) . Summing 
up the errors, and using the fact that each term is bounded in absolute value by 1, we get that 

X{B)=e^'''^^^^^^^^e-^''''p-^''^'^b{i + 5) (13) 

where |5| < 0{p^Y,beB + PUbeB ^t)- ^'^ conclude the proof, note that 

heB heB 

where we recall thatRa^a" = llbeB'P{l^)a''P{b)a"- To bound the error term, recall that |Vi| < ci£ <C 1 hence 

|5| < 0(/|B| + p\B\{cief) = 0{N^/\B\ +Nc\e^). 

4.2 Bounding Fourier coefficients far from M 

Let G T'* be such that d{6,M) > e. Thus, there exists at least on coordinate 6^ whose distance from 
multiples of 1/m is at least £ / y^]A\. Otherwise put, there exists a G A such that 



\mda (mod 1)1 > em/^/\A\. (14) 
Recall that the Fourier coefficient X{d) is given by 



X{d) = lli\-p + pe- 

beB 



27li{(t){b),e)'. 
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Hence, to get a bound on essentially we need to show that {^{b),6) is far from integer for many 

beB. Note that we cannot longer assume, as in the proof of Lemma 1431 that {<^{b),d) is small in absolute 
value, since we assume no upper bound on ||0||2- Thus, it may be the case that {<^{b),Q) is large but still 
approximately integer. Let Vb '■= (mod 1) where |v;,| < 1/2. Our goal is to show that |v/,| is 

noticeably large for many values b £ B. This will then imply the required upper bound on |X(0)|. 

We will show this using the isolation vectors guaranteed by Theorem 12. 1[ Let 7 G be an isolation 
vector for a with modulus m; that is ^(7) = m • ea- We first show that it cannot be that Vh ^ for all 
b G Supp(7). 

Claim 4.9. Let y be such that <^{y) =m- Ca- Then 

2 2 

^ |v^|2>_^2!!_ 

66Supp(7) I'^lll7ll2 

Proof. Using the isolation property of 7 we get that 

£ YbVb (modl)= £ Yb{(l>{b),d) (modi) 
beSupp^y) feeSupp(7) 

= (0(7),0) {modl) = mda (modi). 
Hence by (fT4l ) we get that | LfoeSupp(7) Yh^h (mod 1)| > em/ \/\A\. On the other hand, we can bound 

I £ YbVb (mod 1)1 < I £ YbVbl < 

iGSupp(7) iieSupp(7) 

Combining the two bounds, we get that LfoeSupp7l^ftP ^ ^^"^^/l^l II7II2 claimed. □ 

We now use the assumption of Theorem 12.11 on the existence of many vectors which isolate a with 
disjoint support. Recall that by assumption we have r > \B\/c2 vectors 71, . . . , 7^ G such that: (1) each 7 
isolates a with modulus m; (2) The vectors 7i, . . . , 7r have disjoint supports; and (3) ||7|| < C3 for all / G [r]. 
Applying Claim [49] to each vector 7 independently we derive that 

£|v.p>£2|S|. J^. (15) 

To conclude the proof of the lemma, we apply (fT5l) to derive an upper bound on |X(0)|. The following 
claim is simple. 

Claim 4.10. Letp<\ /2 and \x\ < 1 /2. Then 

\l-p + pe^'"'-'] < exp{-px^). 
Applying Claim 14101 we derive the bound 

1^(6)1 = n 11 + ^ {-P I \^bA < exp f-e^N--^) . 

beB \ beB J V \A\C2C^J 
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4.3 Bounding Fourier coefficients near M but far from L 

Let d GT^be such that d{d,M\L) < e. That is, there exists a£M\L such that a) < s. Since a^L 
there must exist b* ^ B such that {^{b*),a) ^ Z. We will show using the symmetry of ^ that in fact this 
holds for many b £ B. Moreover, since a G M we have that if {<p (b) , a) ^ Z is must be at least l/ni far 
from the integers. This will allow us to give strong upper bounds on the Fourier coefficient X{a) and by 
continuity also onX{d). 

Let denote the lattice generated by {^{b) : b G B}. In other words, ^ is the subgroup of Z"^ whose 
elements are all possible integer combinations of {(p{b) : b £ B}. We first show that any subset of B which 
generates the lattice ^ must contain b for which {<p{b),ma) ^ 0. 

Claim 4.11. Let K C B be a set which generates the lattice Then there must exist b G K for which 
(0(^),ma) /O. 

Proof. By assumption since K generates the lattice , we can express as an integer combination of 

{^{b) : b G K\. That is, there exist integer coefficient a^, foxb £K such that 

beK 

Thus, as {^{b*),ma) / 0, there must exist G ^ for which {^{b),ma) / as well. □ 

We next claim that there must exist at least one small set K C B which generates We will later use 
symmetry to generate from it many such sets. 

Claim 4.12. There exists K C B of size \K\ < 0{\A\ log(ci |A|)) such that {<p{b) :b £K} generates the lattice 

Proof. Let be a minimal subset of B such that {0(^) :b £K] generates the lattice We claim that the 
minimality of K implies that all partial sums for K' (1 K must be distinct. Otherwise, assume that 

there exist two distinct subsets KijKjQK for which 0(^i) = ^(^'2). We can assume w.l.o.g that Ki,K2 are 
disjoint by removing common elements from both. Thus we have 

X m- 1 m=o. 

beKi beK2 

In particular, we can express any b' £ K^UKi as an integer combination of {^{b) : b £ K\{b'}}. Thus, 
we can remove b' from K and maintain the property that the resulting set generates This contradicts the 
minimality of K. 

We thus know that all sums {^{K') : K' C K} are distinct. We now apply the assumption that (j) is 
bounded. By the assumptions of Theorem 12. 11 we know that ||</>(^)||oo < ||0(^)||2 <ci. Hence we conclude 
that 

{^{K'):K' CK}Q[-ciK,ciKf, 

which imply that 

2^ < (2cii<:+l)l'*l. 

It is easy to verify that this gives the bound K < C?(|A| log(ci |A|)) as claimed. □ 

The next step is to use the symmetry of (j) to generate many small sets which span 

Claim 4.13. Let K C B be a set such that {(j){b) : b £ K} generates the lattice Let {n, t) G x GL(y) 
be a symmetry of Let A'^j := {K{b) : b G be a shift ofK by n. Then {^{b) : b G K^^} also generates the 
lattice . 
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Proof. Let b' G B. We need to show that we can express ^{b') as integer combination of {^{n{b)) : b G K}. 
Consider 7i^^{b'). By assumption the image of <p on elements of K generates the lattice Jf, hence there 
exist coefficients G Z for G A' such that 

H^-\b'))='£at(^{b). 

beK 

Applying the assumption that (tt, t) is a symmetry of (p we get that 

= HKiK-\b'))) = z{<p{n-\b'))) = £ a, • z{m) = E a,<p{m)- □ 

beK beK 

We combine Claims 14111 14021 and 1403] to derive that {(j){b),a) / for many b € B. Let B = {b £ B : 

{^{b),a)^0}. 

Corollary 4.14. |B|>a(^T^). 

Proof. Let K be the set guaranteed by Claim gUl where \K\ < 0{\A\ log(ci |A|)). Let iT;! = {K{b) :beK}. 
We know by Claim l4~T3l that for any symmetry (tt, t) of we have 

\Kj,nB\ > 1. 

Let H be the subgroup of permutations on B given by symmetries of <p. That is, H = {n : 
{7C,z) symmetry of We know by the assumptions of Theorem 12. 11 that H acts transitively on B. Thus, 
for any fixed ft G B, if we choose n € H uniformly we have that n{b) is uniformly distributed in B. Thus, 

E^eHlK^nB] = £ F^eH[n{b) £ B] = 

beK 1^1 

We thus conclude that we must have |B|>|S|/|^|. □ 

We conclude the proof of Lemma 14771 by establishing an upper bound of X{a). For any b £ B v/e have 
that {<p{b),a) (mod 1) / 0, hence since a G (1/m • Z)^ we have 

\{(l)ib),a) (modl)|>l/m. 

Recall that by assumption ||0(Z7)||2 < ci and ||a — 0|| < 1/ (2cim). Thus \ {<p{b),a — 6) \ < 1 /2m by Cauchy- 
Schwarz and we get that 

\{(t){b),d) (mod 1)1 > l/2m. 
We thus conclude with an upper bound on |X(0)|. Applying Claim l4~T0l we have 

\X{d)\ <U\^-p + pe''''^^^'^''^\<eM-p{l/2mf\B\)<cxp(-N- ^^^^j ) . 

V m^\A\log{ci\A\) J 

4.4 Proof of Theorem |2l] from Lemmas El |J6] and E?] 

We now deduce Theorem 12. 1 1 from Lemmas [4.51 |4.6| and |4.7[ Recall that we have 

F[X = E[X]]= [ X(0)£'-2^'<^M'^>J0. (16) 

Let N = poly(|A|,m,co,ci ,C2,C3) large enough to be chosen later. We would assume throughout that N is 
a multiple of cqm. If \B\ = 0{N^) then the set B is small to begin with, so assume that |5| ^> N'^. We set 
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£ « N^^/^ so that the conditions for Lemmas [4.51 and 147/1 hold. More explicitly, we set e := 0(1 /cxN^f^) so 
that the conditions for Lemma l4.5l hold with |5| < 1/2; and we assume thatN > Q.{nr') so that e < \/{2c\m) 
and the conditions for Lemma 1477] also hold. 

We decompose the integral in ( fT6l ) into three integrals: over points which are e close to L; over points 
which are e close to M\L; and over points which are £ far from M. Our choice of £ < 1 /2m also guarantees 
that balls of radius £ around distinct points in M are disjoint. We thus have that P[X = E[X]] = I^+Ij+h 
where 

JeeT'^:d{e,M)>e 

We first lower bound I\ . 
Claim 4.15. 

Proof. We first use the assumption that N divides com to reduce computing I\ to an integral around 0. We 
claim that the assumption that com\N implies that (E[X],a) S Z for all a G L. This is since this choice 
implies that all entries of E[X] are divisible by m since 

E[X] = = {N/cQm)-m~(l>{B) emZ"^. 

\B\ \B\ 

Moreover, since a £ L C M we have that ma G Z'^, hence (E[X],a) G Z. Combining this with Claim 1431 
which states that the Fourier coefficients of X are invariants to shifts by a G L, we deduce that 

h = \L\ I x(0)^'-2^'<^m'«)j0. 

>/0eT^:||0||2<e 

Recall that by Lemma [43] and our choice of parameters, if ||0||2 < £ then 

x(0)=x(0)(i + 5(e)) 

where X(0) =e2?r'(iEm,e)g-4;rVe^Ke ^^^^^ i^^^^j < ;^/2. Hence 

h = \L\f e-'''"P-'"''\l + 5)de. 

JeeT'^:\\e\\2<E 

Consider 

/; = |L| / ^--^^'p-^'^^c/e. 

JeeT'>:||e||2<e 

We claim that |/i | > |/{ | /2, hence it suffices to lower bound |/( | in order to lower bound |/i | . To see that, note 
that I[ is an integral of a real positive function; that we can always lower bound |/i | by its real part Re(/i); 
and that Re(l + 5) > 1 /2 since |5| < 1/2. Thus 

|/i I > Re(/i ) > Re(/i )/l = l[/l. 
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We next lower bound I[. Note first that we can bound d^Rd < Bc1\\d\\2- This is because 

e^Rd = l^{d,mf < I \\e\\l\\m\\l<Bcj\\d\\i 

heB heB 

Thus we get that 

/; > |L| / e-'^''^"''\\'\\idd. 

>/eeT-*:||e||2<e 

We bound I[ from below by the volume of the region in which the integrand is constant. This occurs 
whenever ||0||2 < e' = 0{\/{ciN^/^)). Recall that we chose e = 0(l/(ciA^'/^)) > e'. Hence the ball of 
radius e' is contained in the area over which we integrate, so we obtain the lower bound 

I/, I > /I /2 > 1^1 . 0(1) . V„l(Ball(0,e')) = L ■ (^) " = L • {^^^J^) " ^ □ 

The next steps are to bound I2 and Ii, from above. We bound them by the maximal value that can 
achieve in their integral domains. Lemma 1477] gives a bound on I2, 

m < max{lX(e)| : d(0,M\L) < s) < exp , 

and Lemma 1461 and our choice of £ = 0(1/ (ciN^^^)) gives a bound on 73, 

|73| < max{|X(0)| : J(0,M) > £} < exp (-Ne^ • = exp (-0{N'^') ■ f , 

We now need to choose large enough so that I\ » I/2I, I/3I. This can be accomplished since Ii decays 
polynomially with N, while I/2I, I/3I decay exponentially fast. It is not hard to verify that this is guaranteed 
whenever 

N > il(l) •max(AWlog^(mAcoCiC2C3),A^CiC2cf log^(mAcoCiC2C3)). 



5 Summary and open problems 

Our main theorem guarantees the existence of a small subset T C B for which ([T]) holds. The conditions 
we require are boundedness, divisibility, symmetry and isolation. The first three conditions seem natural 
for this type of problems, but the fourth seems artificial, as it depends on the specific basis we choose for 
V. Thus, we wonder if this condition can be removed. In particular, the following question captures much 
of the difficulty. Let G be a group that acts transitively on a set X. A subset T C G is X-uniform (or an 
X-design) if it acts on X exactly as G does. That is, for any x,y E X, 

^lU G r : g{x) =y}\ = ^^\{geG: g{x) = y}\ = ^. 

In our language we may take B = G and V to be the space spanned by all functions ^[x,y) : B — {0, 1} of 
the form (l)(x.y){b) = l{b{x)=y} for ^ ^- Then T is X-uniform if and only if ([T]) holds. Taking A to be 
some subset of X x X for which {<pa)aeA forms a basis of V, the boundedness, divisibility and symmetry 
conditions are clearly satisfied. However, it is not clear whether the isolation condition is satisfied as well. 
If indeed the isolation condition is redundant, one may conjecture that: 

Conjecture 5.1. Let G be a group that acts transitively on a set X. Then there exists an X-uniform subset 
T C G such that \ T\ < |X|'^/or some universal constant c> 0. 
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A second question is whether one can apply our techniques to get minimal objects. Recall that the 
size of the objects we achieve is only minimal up to polynomial factors. For example, one of the main 
open problems in design theory is whether there exists a Steiner system (i.e. a ^-design with A = 1) for 
any t > 5. Another major open problem of a similar spirit is the existence of Hadamard matrices of all 
orders n = 4m, or equivalently, 2- (4m — 1,2m — l,m — 1) designs. Empirical estimates for n <32 suggest 
that there are exp(C?(?i(log?i))) Hadamard matrices of order n = 4m. Since are so many of them, and since 
the logarithm of their number grows at a regular rate, we suspect that they exist for some purely statistical 
reason. However, the Gaussian local limit model seems to be false for Hadamard matrices interpreted as 
f-designs; it does not accurately estimate how many there are. 

A third question is whether there exists an algorithmic version of our work, similar to the algorithmic 
Moser fMos09 | and Moser-Tardos |MT10| versions of the Lovasz local lemma IIEL7 5 1, and the algorithmic 
Bansal [BanlO| version of the six standard deviations method of Spencer |Spe85 1. If an efficient randomized 
algorithm of our method were found, then we could no longer indisputably claim that we have a low- 
probability version of the probabilistic method. On the other hand it would be strange, from the viewpoint 
of computational complexity theory, if low -probability existence can always be converted to high-probability 
existence. Maybe our construction is fundamentally a low-probability construction. 
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